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Abstract 

An aperiodic and irreducible Markov chain on a finite state space converges to its station¬ 
ary distribution. When convergence to equilibrium is measured by total variation distance, 
there exists an optimal coupling and a maximal coupling time. In this article, the maximal 
coupling time is compared to the hitting time of a specific state or set. Such sets, named 
halting sets, are studied in the case of symmetric birth-and-death chains and in some other 
examples. Some applications to the cutoff phenomenon are given. These results yield new 
methods to calculate cutoff times for some monotone birth-and death chains without the 
lazy hypothesis . 
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1 Introduction 

Let X be a finite space, {Xt)t>o ^ discrete or continuous time irreducible Markov chain on X. 
Let TT be its stationary distribution . For t > 0, denote by tt^ the distribution of Xt. Let d be a 
distance between probability distributions on X. For t > 0, denote by d{t) the distance between 
the distribution of Xt and tt: 

d{t) = d{TTt,TT) 

For A C X, denote by Ta the hitting time of A. 

Definition 1.1. A set A C X is a halting set for the distance d and the Markov chain {Xt)t>o 
if for all t>0, 

d{t) < F{Ta > t) 


If the Markov chain is also aperiodic in the discrete time case, dit) goes to 0 when t goes 
to infinity. The question of non-asymptotic behavior is hard. There are several techniques for 
obtaining explicit bounds on d{t) : Fourier analysis, coupling, strong stationnary times, ... 

Several of them use hitting times. 

Strong stationary times are allied with separation distance, s{t) = max^, ex(l-^). Ta 
randomized stopping time is a strong stationary time if T and Xt are independent and if the 
law of Xt is given by the stationary distribution tt. In [5], Aldous and Diaconis prove that there 
exists a strong stationary time T such that for all t > 0, s{t) = P(T > t). Furthermore, in [7], 
Diaconis and Fill construct an absorbing dual Markov chain X* such that T is the hitting time 
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of an absorbing state for X*. This is used by Diaconis and Saloff-Coste in for birth-and-death 
chain to prove some cut-off results. 

Let T be a strong stationary time. A state y* is a halting state for T if A* = y* implies T < t 
or equivalently if T < Ty*. Furthermore, if there exists a halting state for T, then s{t) = P(T > t) 
for all t. This is an example of the use of hitting times to obtain bounds on a distance. 

In coupling methods, hitting times are very often used to bound P(r > t) where T is the 
coupling time. Several examples can be found in the book of Levin, Peres and Wilmer [17] p.65. 

In [in], Martinez and Ycart also use hitting times to obtain bounds on the total variation 
distance. They prove that in general, the access time to equilibrium and the hitting times tend 
to be equivalent, if the process starts “far away”. But the context is different, they consider 
continuous time Markov chain on a countable set I and they study ||5aLt ~ when a goes 

to infinity. 

In this article, mainly the total variation distance is used, that is: 

d{t) = Ikt - ttWtv ■■= sup |7rt(A) - 7r(A)| = -^ V |7rt(a;) - Tr{x)\ 

Find some bounds on d(t) is equivalent to bound the mixing time of the chain. For e £]0,1[, 
denote by defined by 


t-mixi^) = > 0/d(t) < e}. 

In [IT] (theorem 10.14 p 134), the autors established the following theorem for a lazy chain : 

— 2rnax[Ta;] -|- 1 

4 x£X 


In fact, they prove that d{t) < y where ttq = S^. 

In [in]. Ding, Lubetzky and Peres prove the following result for a lazy birth-and-death chain 
on {0,..., N} started at 0, 

\/0 < I < N, Vn e N, ||7r„ - ttWtv < ^{Ti > n) + Tr{{l -|- 1,..., N}) (1) 

So it is a natural idea and often used method to compare the access time to equilibrium with 
the hitting time of a given state. This state will depend on the initial state. 

One object of this article is to look at “minimal” coupling time for the total variation distance, 
mimicking in some meaning what is done for the separation distance. The construction of Aldous 
and Diaconis of a randomized stopping time satisfying s{n) = P(T > n) for all n leads naturally to 
a strong stationary stopping time. So rewriting what can be found in m and El], a randomized 
stopping time T is constructed satisfying d{t) = P(r > t) for all t and the properties of the law 
of {Xt,T) lead to the following result: 

Theorem 1.2. Let M. = {x & Xj Mt > 0, TTt{x) < 7r(a;)}, then M. is a halting set for the total 
variation distance and the chain (Xt)t>o- 

In the following, we omit to say for the total variation distance and the chain {Xt)t>o- Of 
course if a;* G A4, {x*} is a halting set and so for all t > 0, d{t) < > t), a, property to be 

compared to O. A such state is called a halting state. 

The difficulty then will be to hnd halting sets. The first fact is that if we have some property 
of monotonicity, a halting set must exist (proposition (13.21) . 
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Halting sets can be used to prove some results about cutoff. This concepts was intoduced 
by Aldous and Diaconis in [I] to describe the fact that many ergodic Markov chains converge 
abruptly to their stationary distribution. 

Consider a family (^n^^)n>o of aperiodic irreducible Markov chains on a finite state space 
each with its stationary distribution Tr^^^and its distance from stationary tpf is a 

cutoff if 

d^^\[ctN]) —> 0 if c > 1 
1 if c < 1 

Diaconis and Saloff-Coste in verified this conjecture for continuous-time birth-and-death 
chains, started at an endpoint, with convergence measured in separation. 

Ding, Lubetzky and Peres in m proved it for continuous-time birth-and-death chains and 
lazy discrete-time birth-and-death chains, started at an endpoint, with convergence measured in 
total variation. For such chains, the fact that the product of the mixing time and the spectral 
gap tends to infinity is equivalent to the fact that the product of the expected value of the hitting 
time of the median of the stationary distribution and the spectral gap tends to infinity. 

The cutoff depends of the distance. For example, the Ehrenfest process starting at 0 on 
{0,..., N} has a separation cutoff but a total variation and cutoff. The biased 

(p; q)-random walk starting at 0 on {0,..., N} has a total variation and separation cutoff 

but a 2 ( 1 - 2 ^) N cutoff. 

Recently, Basu, Hermon and Peres in [1] , study the link between cutoff and some concentration 
of hitting time of “worst” sets of stationary measure at least a, for some a €]0,1[. They prove 
that in the case of a lazy reversible irreducible Markov chain, cutoff is equivalent to a notion of 
cutoff for hitting times, denoted hita-cutoS. Here, we use explicit hitting time, the hitting time 
of a halting state for total variation. 

Recall the definition of a cutoff window : 

If (wn) and (In) are two sequences such that wn = o(tN), one may define that the family of 
chain {Xn^'^)n>o exhibits a cutoff at with window wn if 

lim..y_»+oo hmjv_^_)_^ - ywAr]) = 1 

lim..y_>+oo limAr-i.+oo + jwn]) = 0 

A general result can be the following in the discrete time: 

Proposition 1.3. Let be a halting set such that a{Ty^(N)) = o(]E[r, 4 (Ar)]) . Let A^(Ar) be the 
largest eigenvalue of the restriction of the transition kernel of the chain to 'K \ . 

// (1 — A^(N))E[r^(N)] goes to infinity when N goes to infinity, then the chain has a cutoff at 
time E[T^(iv)] for total variation. 

If furthermore, lim fl — A^(n) )cr(T 4 (jv)) > 0, then the chain has a cutoff at time E[r^(iv)] with 
window a(Tj^(N )) for total variation. 

The rest of the paper is to determine halting set in several examples, essentialy in the case of 
birth-and-death chains. A Markov chain on {0,..., 2N + 1} on {0,..., A^} with transitions given 
by K = (k{x, y))x,yex is symmetric if for all x,y € X, k{N — x,N — y) = k{x, y). An example 
of result is the following: In the case of a monotone and symmetric birth-and-death chain on 
{0,..., 2N -I- I}, A^ -I- 1 is a halting state for total variation. Furthermore, if 1 = Aq > A^"^) > 

2N+1 ^ 

• • • > are the eigenvalues of the transition kernel and if (1 — ^ goes to 
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2N+1 

infinity when N goes to infinity, then the chain has a cutoff at time ^ for total 

fc=i 1“ \ 

variation. 

The paper is organized as follows. In section 2, the construction of an optimal randomized 
stopping time for the total variation distance for discrete and continuous times markov chains 
is given. Section 3 exposes the notion of a halting set for total variation for a chain with an 
initial distribution and gives applications to cutoff. In section 4, general results about monotone 
birth-and-death Markov chain are given and the case of symmetric monotone birth-and-death 
Markov chain is studied. These results are used to prove cutoff in several examples. 


2 Optimal stopping time 

2.1 Construction of an optimal stopping time in discrete time 

This construction is not new but the presentation made here is different from Lindvall’s proof 

in pT| . 

Let {Xn)n>o be a Markov chain on a finite space X with a stationary probability tt such that 
for a\\ X & X, Tr{x) > 0. 

The transition kernel is denoted hy K = {k[x, y))x,yex- Let ttq be the distribution of Xq and 
for n > I, 7r„ = be the distribution of X„. Let = ||7r„ — ttHtv be the total variation 

distance between 7r„ and tt. 

We define for n > 0, 7 „ : If —>• R+ by 


7 o(a;) = TTo A ^{x) and for n > 1, 7 „(x) = (7r„ A Tr){x) — (7r„_i A 7t)K{x) 


As d{n) = I — ^ {TTn{x) A 7r(x)), for all n > I, d{n — 1) — d{n) = ^ 'yn{x). 

x^X x^X 

By consequence, we are looking for a randomized stopping time T which satisfies for all n > 0, 
for all X € X, P(Ar„ = x,T = n) = 7 „(x). 

Proposition 2.1. Let A„ be defined by 


( Ao = TTo 

\ A„+1 = TTn+i - (7r„ A Tr)K 


Let be defined for all n > 0 by 


f’nix) = 


*/ (x) 0 

1 ifAn{x) = 0 


( 2 ) 

(3) 


For all n > 0, tpn takes value in [0,1]. 

Let (Un)n>o be independent variables uniformly distributed on [0,1], independent of {Xn)n>o- 
Let T be the randomized stopping time defined by T = inf{n > 0 / Un < 'i/'n(A’„)}. 

Then Vn > I, Va: € X, 

P(A„ = x,T = n) = ynix) (4) 

P(X„ = x,T > n) = A„(x),P(X„ = x,T > n) = 7r„(x) - (tt^ A tt){x) (5) 

P(T >n) = d{n) (6) 
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Proof. As 0 < 7 „ < A„, fjn takes value in [0,1] and T is well defined. 
We prove ([5]) inductively : 


Ao(a;) = 7ro(x) = P(Ao = x,T >0) 

P(A „+1 = X,T > n+ 1) = = y^^n+l = X,T > n,Un > f^niy)) 

= Eyex(l - ipn{yW{Xn = y,T> n)k{y, x) 

= ((1 - ll}n)Xn)K{x) = (A„ - tl}nAn)K{x) 

= (A„ - 'yn)K{x) = An+l{x) 

The second equality conies from the Markov property and definition of T. m and dni) are easy 
consequences of 

□ 


Remark 2.2. ipn is given by the following formulae: 


f^nix) = 


Injx) 

A„(x) 


■n/\Tr{x)-(TT„-l/\Tr)K{x) -f A / X / ^ 

—(^Tn —lA7r)iC(x) ^ ^ 

1 i/A„(x)=0 


(7) 


Remark 2.3. A n alternative definition can be made for the randomized stopping time T. 

ITe define for all n > 0 

n 

Ju=Y[{l-MXk)) ( 8 ) 

k—0 

and T = inf{n > 0/ U > J„} where U is a random variable uniformly distributed [0,1] indepen¬ 
dent of (A„)„>o. 

Remark 2.4. To study distance from stationarity for Markov chains, separation distance has 
good properties. 

The separation distance is defined by 

s{n) := sep{TTn,TT) = sup s{n,y) where s{n,y) = \ - 

yGX T^[y) 

and satisfies d{n) < s(n). 

The following result was established by Aldous and Diaconis (1987) : 

• If T is a strong stationary time, for all n G N, s{n) < P(T > n). 

• Conversly, there exists a strong stationary time T such that there is eguality for all n £ N. 


The construction of the stochastically optimal strong stationary time is the following : 

7 „(x) = (s(n — 1) — s{n))7T{x), so as before ln{x) = s{n — 1) — sin) 

With the method of proposition with i(o(.x) = (1 — s{0))^^^ on the support of no, we 
obtain P(A„ = x,T = n) = {s(n — 1) — s(n))n(x). So T is an optimal strong stationary time. 

Remark 2.5. We see in the preceding remark that in the cases of separation and total variation, 
we can construct a randomized stopping time T such that d{n) = P(T > n) for all n £ N, the 
construction depends of the choice of jn such that ln{x) = d{n — 1) — d{n). 


5 






For example, if for the total variation distance, we take "fnix) = {d{n — 1) — d(n))TT(x) as for 
separation, we can prove that it does not exist ipn with values in [0,1] which satifies Anifn = 7n- 
We can deal with others distances and the first condition is to have n —> d{n) decreasing. 

For any convex function f : R+ —>■ R, one may define the f-divergence distance on P(£') by 

These measures are studied in m by Liese- Vajda. 

We suppose that /(I) = 0, so (i/(/i,7r) > 0. 

Some examples : 

• The total variation distance given by f{x) = ^\x — 1|. 

• The relative entropy given by f{x) = a;ln(a;). 

• The Hellinger distance given by f{x) = ^(1 — v^)^- 

• The distance given by f{x) = ^(1 — x)^. 

We note df{n) = (i/(7r„,7r). 

Let 7 „ : X —R 6e defined by Xnix) = ))K — f{J^)'x. So dfin — 1) — df{n) = 

^^ 7 „(a;). The convexity of f implies that 7 „(x) > 0. 

a:GX 

In all generality, df(n) can be superior to 1 and so it will be impossible to find a randomized 
stopping time T such that for all n> 0, F{T > n) = d{n). 

So a constant c > 0 and a function ipo ■ ^ ^ [0,1] have to be found such that for all n > 0, 
P(X„ = x,T = n) = = tjjr,{x)An{x). 

If we want P(T < +c») = 1, it is necessary to choose ipQ such that 

1 = p(r = 0) + ^ p(r = n) = ^ Mx)Mx) + ^d/(o) 

n>l x^X 


By mimicking the preceding calculus, for all n > 1, 
An(x) = TTnix) - 

So the condition A„ > -^n implies that 


(V'oTTo) + -/(—)7r 

C TT 


+ -/( —)7r(x) + -7n(x) 
C TT C 


f{ — )'X > -CTTn + Ici'lpQTTo) + /( —)7r 
TT L TT . 




In the case where ttq = Sxq, this condition becomes /(—) > /(0)(1 — —). 
But as /(I) = 0 and f is convex, if < 1, < (1 - 7j|^)/(0)- 

This implies that //[0, 1] is affine. 

In our examples, the only one is the total variation distance. 


2.2 Construction of an optimal stopping time in continuous time 

Let Q = {q{x, y)){x,y)exxx be the generator of an irreducible positive recurrent Markov chain on 
the finite space X. 

As before, ttq is a probability measure on X. {Xt)t>o is a Markov chain with initial distribution 
ttq and generator Q. 
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We write ttj = ttq exp(tQ) for the distribution of Xt- 

Let TT be the unique stationary distribution, tt is the unique distribution satisfying ttQ = 0. 
We write d{t) = ||7rt — TrUry. 

Let {Jt)t>o be defined by 


f 0 if 3s e 7rs(Xs) < 7r(Xs) 

(1 - MXo))exp (/q jX?-ig) rfs) else 

where i/jq is defined in (|31). Let 17 be a uniformly distributed on [0,1] random variable independent 

of {Xt)t>o- 

A randomized stopping time T is defined by T = inf{t > 0/ U > Jt}. 

Proposition 2.6. 


vt > 0 , d{t) = P(r > t) 

Vt > 0, P(T < t, Xt = x) = {iTt A tt) (x) 

\/t > 0,F{T < t, Xt = x) = (iTt A tt)(x) — f (tTs A 7r)Q(x)ds 


Jo 

The distribution v of {T,Xt) on (R+ U {+oo}) x X is given by 

iy{ds, x) = (tto a Tr){x)6o{s) + {l^^^^)^^(^^'jTTsQ{x) - (tt^ A TT)Q{x))ds 
The proof of the proposition (EH) can be found in part (5.2) of the appendix. 


( 10 ) 

( 11 ) 

( 12 ) 


(13) 


2.3 Some remarks about weak and maximal coupling 

In the preceding sections, the construction of an optimal randomized stopping time T such that 
for all n, P(T > n) = d(n) is given. It is easy to see that by the same way, a randomized stopping 
time T for a Markov chain {Yn)n>o whose initial distribution given by tt, can be defined such 
that P(T > n) = d{n) and P(A„ = x,T = n) = F{Yn = x,T = n) = 7 „(x). 

So ((A„)„>o, T) and {{Yn)n>o^ T) is a weak optimal coupling as defined in El]- Then Lindvall 
prove that it exists (X, Y) a coupling of {X, Y) such that the coupling time T of (X, Y) has the 
same distribution as T. 

Furthermore, the coupling process (X, Y) can be chosen such that it is a time inhomogeneous 
Markov chain. The transitions can be computed but are quite complicated. It is not necessarily a 
Markovian coupling. By definition a Markovian coupling ((X„, y„)„>o of (ttq, X), (tt, K) satisfies 
the following property : 

VX, under P( |(Xfe, Yfc)o<fe<Ar), (Xvr+n, YAr+„)„>o is a coupling of (ttat, X), (tt, X) 

This is sometimes called causal or co-adapted coupling. These ones do not alway^exist. ^ 
Furthermore, it can be proved that T = min(5, 5) where S = inf{n > 0/7r„(X„) < 7r(X„)} 
and S = inf{n > 0/7r„(F„) > 7r(F„). 

Here is an example of what is obtained for the two-state chain in continuous time. 

X={0,1}, (5=( ^ Y 0 = A-I-/i and TTo = pJo 3-g^i. 

\ d d / 

The stationary distribution tt is given by tt = and suppose that p > ^. 

We have tt* = (f + £^e-‘«)5o + (| - 2^e-*®)<5i and d{t) = 
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The initial law of the coupling is given by what is called the 7 -coupling of (tto , tt) in [5T] , 

P((Xo, To) = (0,0)) = ^(0), P((Xo, To) = (1,1)) = ^(1), P((^o, f^o) = (0,1)) = P - ^(0) 

We find in this case an homogeneous Markov chain on X x X with generator given by 
g((0,1), (1,1)) = A, 9((0,1), (0,0)) = g((l, 1), (0,0)) = p, g((0,0), (1,1)) = A 

The distribution of the coupling time T* is {q + ^)6o + {p — ^)£{9). 

In |12j . Fill finds that in this example, the optimal coupling time for separation is given by 
an exponential random variable with parameter 6. 

3 Halting set for total variation distance 

3.1 Proof of theorem II.2L existence of halting set 

By the preceding constructions, for all t > 0, dit) = P(T > t) = ¥(U < Jt). By definition of Jt 
®, ®, {U < Jt} C {Tm > t}. SoT< Tm- 

Remark 3.1. How can we find a halting state ? 

Suppose that our Markov chain is a reversible aperiodic Markov chain. 

Let |X| = TV -b 1 and let 1 = /Sq > f5i >■■■ > > —1 be the eigenvalues of K with 

L'^{tt) -normalized eigenvectors Vqi • ■ •; ■ The spectral decomposition gives: 

I \ ^ 

-1 = ^PkC^My)yk{y)Vk{,x)) 

Denote by p = max(|/3j|, 1 < T < TV). So if an halting state x* exists, it must satisfy 

E E 'Xo{y)Vk{y)Vk{x^) <Q and ^ ^ 7 ro(y)t 4 (y)t 4 (a;,) < 0 (14) 

fe/l/3fc|=py6^ k/\Pk\=P 

Of course, the set Ai can be empty. Property of monotonicity can imply the existence of 
halting state, as proved in the following proposition. 

Let ^ be a partial order on X. 

A Markov chain on X with transition kernel given by TV = {k{x,y))x,yex is monotone if for 
all X < y, the probability TV(x, •) is stochastically smaller than TV(y, •). That means that for an 
increasing function /, Kf is increasing. 

Proposition 3.2. Let K = {k{x,y))x,y^x a monotone kernel for a partial order on X. Suppose 
furthermore that there exists a smallest element denoted by 0 and a largest element denoted by 

1 . 

Then 1 is a halting state for the Markov chain started at 0. 

Proof. ^ is decreasing. We prove inductively that for all n > 0, ^ is decreasing. 

It comes from the relation Thi±i = x( — ). 

So if ^(1) > 1, for all a: € X, 7 r„(a:) > 7 r(a;) that is false. □ 

Remark 3.3. In the continuous case, the hypothesis is changed to the following, for all t > 0, 
the kernel Pt = is monotone. 
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3.2 One example: The riffle shuffle 


The well-known rifle-shuffle is a method of shuffling cards. Its mathematical description was 
made by Gilbert and Shannon (see m) and independently by Reeds [20] . A sharp mathematical 
analysis for the riffle shuffling was carried out by Bayer and Diaconis (1992) [^. 

Denote by (crn)n>o this Markov chain on the symmetric group &n- cto is the identity and 
the stationary measure is given by the uniform one. 

Let f{a) be the number of rising sequence of a. In [5], corollary 2. it is proved that {f{(Tn))n>o 
is a Markov chain on X = {1,..., N}. Its stationary probability is given by z/(r) = where 
A]\[^r, the Eulerian number, is the number of permutations with r rising sequences. 

If we denote by the law of /(cTn), we have ||7r„ — 'kWtv = \Wn — 

fN + 2'^-r\ 

I N ) 


In [^, theorem 3, it is proved that P(cr„ = a) = 


if f{a) = r. 


So a halting state for total variation for the chain (/(crn))n>o is a state r such that for all 

/iV + 2”-r “ 

n > 0, '' 


/iV + 2"-r\ 

I N ) ^ J_ 

- ■ ^ — N\ ■ 

We can see easily that 


is a halting state for total variation. 

Furthermore {f{<Jn) — l)rt>o is a Markov chain on {0,... ,iV — 1} which is symmetric, i.e. 
k{x, y) = k{N —1 — x,N—1 — y). We shall see later that for symmetric birth-and-death chain, 
the middle element is often a halting state. 

If for is the set of permutations with more than rising sequence then we have ||7r„ — 

ttWtv < P(Tot > n). 

We deduce using Theorem 4 of ( jSj) that F{Tw > n) > 1 — + 0/ ^ 

/ X 1 dt 

e~~^ _ 

-oo V 

So for example, lim 


(^!-r) where 0 < c, 

ATI 


■log2(Af5) 


> 1 . 


3.3 Applications to the bounding of the total variation distance and 
to problems of cutoffs 

To prove the proposition 11.31 the following results found in the book of Aldous and Fill are 
used. 

They are given in the discrete time. 

Let P be an irreducible transition matrix on a finite space X with a reversible distribution tt. 
Let A C X and = P/A‘^ x A'^. Let be the largest eigenvalue of P"^ and Ai the second 
eigenvalue of P. P^ is supposed to be irreducible and aperiodic, then: 

1-A^ >7r(A)(l-Ai) (15) 

The proof can be found in |3], Theorem 33, Corollary 34, Chapter 3 Reversible Markov Chains. 
For all n > 0, 

^ATa >n)< 7r(A-)\^ (16) 

The proof can be found in [3], Proposition 21, Chapter 3 Reversible Markov Chains. This is 
an application of the Perron-Froebenius theorem to P"^. 

These results are also used in m and g]. 

The third useful result, seen in g] is the following : 

Vx € X,yn,m, Vx{Ta > n + m) < - ttWtv + ^tv{Ta > m) (17) 

We prove this result for clarity: 
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Proof. Let {Xn)n>o be {Sx,P) Markov chain. Let n > 0. 

We can find a random variable U independent of {Xn)n>o and a function of Xn and U denoted 
by Yn such that the law of Yn is tt and \\SxP^ — 'n’Wrv = ^{Xn ^ Yn). It is a standard argument 
of coupling. 

We have Pa;(r^ > n + m) < P(X„ ^ Yn) + F{Ta > n + m, Xn = Yn). Using the Markov 
property, we have 


F{Ta > n + m, Xn = Yn) = E[lr^>„,x„=Y-„P('U 4 (W) > n + m|cr(Xfc, fc < n) V cr(C/))] 

= ¥\lTA>n,Xn=Yj^Xn{TA{X) > m)] 

So ¥x{Ta > n + m) < ||4P” - ttHtv + E[Py„ (Ta(X) > m)]. □ 

Proof of proposition 11.31 

The proof is similar to these found in the paper of Ding, Lubetzky, Peres ( m) and of 
Diaconis and Saloff-Coste ( 0) or in g]. 

Bounding the total variation from above For all n > 0, S^^n) < F{Ta(n) > n). So for 
all 7 > 0, 


(i^^^(|'E[T^(iv)] + 70 ’(^A('v))l) < P(T^(iv) — E[T^(iv)] > 'ycr[TA(N))) < ^ ^ 


Bounding the total variation from below By (ITTl) . 

df'^\n) > P(T^(iv) > n + m) — FTr{TA(N) > m) 


Let 0 < e < 1 and n = [(1 — e)E[T^(iv)]] and m = [|E[T^(jv)]]. Then by (ITfi)) , 
F^iTAW >m) < (1 - Tr{A^^')))e-^^^-^Am^ 


So 


P(T 4 (n) > n + m) 




> 1 - 




-fE[T^(Ao]) 


D’ou dW([(l - e)E[r^(«)]]) > 1 - 

If n = [E[T^(n)] — ^<j{Ta(n))], we obtain using the same inequalities, 

d!'’^\n) >l-\- 
7 


Remark 3.4. If lim > 0, the condition (1 — )E[T^(Ar)] —>■ +oo can be changed to 

(1 — A 2 ’^^)E[T 4 (iv)] —>■ +00 by the property hl5\) . 
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4 Birth-and-death chain on = {0,. .., N} started at 0 

4.1 General results 

For every birth-and-death chain on {0, • • • ,N}, we will denote 

Px = k{x, a;-|-l) ifO<a;<-/V—1 

qx = k{x, X — 1) if 1 < X < N 
Tx = k{x, x) if 0 < a; < TV 


We have for all 0 < x < Pa; -I- r^; -|- = 1. We suppose the Markov chain irreducible, so for 

all 0 < X < — 1 , Pa: >0 and for all 1 < x < A^, (/a, > 0 . 

The chain is started at 0. For x, p £ Tx is the hitting time of the state x and Tx,y is the 
hitting time of y by the chain started at x. 

To use proposition (1131), it is necessary to know the spectrum of the transition matrix. So 
below, is given a result without eigenvalues. 


Proposition 4.1. Suppose that x^ is a halting state for total variation for the chain started at 
0 such that E[Ta;j;^] goes to -|-oo when N goes to -|-oo and such that <j{Tx»^) = o(E[Ta;j;^]). 

Suppose also that there exists y < x^ such that E[ra:^] ^ Tr^^^dO,..., p}) —^ 0; 

Then the chain has a cutoff at time ElTa,* ] for total variation. 

If furthermore E[ry^a:* ] = 0{a{Tx» )), then there is a cutoff at time E[ra:* ] with window 
a{Tx*^) for total variation. 


Proof. To bound the total variation distance from below, the following inequality is used : 
For all n > 0, d^^'^{n) > PlTy > n) — : v})- 

lfn=[il-e)E[Tx.J, 

nTy >n) = 1 - P (^E[r,] -Ty> E[r,^](e - (1 - |^))) 

1 _ _ 

“‘N 


So > 1- 


- e|t I - 7rW({0:---:2/})- 


According to the hypothesis, it is clear that lim — e)E[Ta;* ]]) = 1. 

AT—>-+cxD ^ 

If furthermore E[Ty,a:;,] = 0{a{Tx‘^)), if 7 big enough, 

1 


dW(E[7;.]-7a(T,.))>l- 


(7- 


E[t, 


y-Nl-^2 


-^W({0,...,p} 


dr.. ) 


(18) 


□ 


There are several methods to calculate E[ra:] and a{Tx) for a birth-and-death chain. The first 
one uses the spectrum with the following formulae, see [16] : 


X—1 


X—1 


1 - Ofc ’ (1 - 


k^O 




(19) 
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where ao > ■ • ■ > o^x-i are the eigenvalues of ■ This implies the following inequality: 


a(T,)2 < 


1 


-E[T, 


1 - ao 

The second one uses the following formulae : 

X—1 X—1 

E[Ta;] = ^E[ry,j^+i] and a(T^f = 


( 20 ) 


( 21 ) 


y=o 


IDT- 1 ’r({0)---.4) ^2 

^ ’ ^{'^x,x+l) — 


y^O 

2 ^("/n ^,\^2 


Px7t{x) 

Sometimes, it is possible to calculate E[r 2 ,] by using martingale. 




Hx)Px ^ PyT^iv) 


4.2 Monotone birth-and-death chain on = {0,..., A^} started at 0 

A birth-and-death chain is monotone if for all 0 < a: < iV — 1 , -|- qx+i < 1- 

In this case if ttq = Sq and as usual, 7 r„ = ttqK'^, then for all n > 0, a: i—> ^^{x) decreasing 
and by consequence A^ is a halting state for total variation as proved in proposition (13.21) . 

An example of monotone birth-and-death chain is given by birth-and-death chain with positive 
spectrum and in particular by lazy chain which are chain with r^, > ^ for all x. 

Remark 4.2. With the notation of the section 0 it can be proved that in the case of a mono¬ 
tone birth-and-death chain on {O,--- ,N} started at 0, the support of the distribution of Xt is 
{ 0 ,... ,a;*} where x* is the smallest halting state for total variation. 

+ 00 

Indeed, P(At = x) = ^nix). 

n—0 

So P(At = 0) > 70 (0) > 0, P(Xt = 1) > 71 (1) = Po A 7 r(l) - po 7 r( 0 ) > 0 
By iteration and by using monotonicity of x ^ '^f!‘{x) > 

Vn > 0, 7 ra(a:) = 0 Vn > 0, 7 r„(a: — 1) < 7 r(a: — 1) 

By consequence, the support of the distribution of Xt is {0,..., a;*} where for all n > 0, 7 r„(a;*) < 
7r(x*) and there exists n > 0 with 7 r„(a:* — 1) > 7 r(a;* — 1). 

So x* is the smallest halting state for total variation. 

The proof is the same in continuous time using fl0). 

There is an interesting property satisfied by monotone birth-and-death chain. With the 
notations of mi 

Lemma 4.1. If P is a monotone irreducible birth-and-death matrix with eigenvalues given by 
1 = Po > Pi > ■ ■ ■ > Pn > -1, then \Pn\ < Pi- 

Proof. The spectrum of an irreducible birth-and-death chain satisfies 1 = Po > Pi > ■ ■ ■ > Pn, 
so p € {Pi, |/3Ar|} 

We have seen by monotony that for all n > 0, < 1. But 

N 

^-1 =Y,P^V,{0)Vk{N) 

k^l 

N-1 

= PYVi{0)Vi{N) + P%Vn{0)Vn{N) + ^ PlVu{f))Vk{N) 
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But by using property (1301) of the appendix on the eigenvectors of irreducible birth-and-death 
chain, we have that Vi(0)Vi(A^) < 0 and (—l)^VAr(0)VAr(A^) > 0. 

So if I/JtvI > /3i, — 1 ^ /3]^VAr(0)y/v(-^) and we have a contradiction with the fact that 

for all n > 0, - 1 < 0. 

By consequence I/Jat] < /3i. □ 

Example 4.3. Metropolis chains 

Let TT be a probability on = {0, ..., N} with tt{x) > 0 for all x € . Use the Metropolis 

algorithm with base chain the simple symmetric random walk to obtain a birth-and- death chain 
with stationary measure tt (see e.g.( JE/))- By construction, it is a monotone chain and so N is 
a halting state. 

Example 4.4. Simple random walk 

Consider the birth-and-death chain on = {0,... ,N} started at 0 with p^ = p, 
ro = q and r^ = P- We suppose 0 < g < p < 1 and p + g = 1. It is a monotone chain, 
a halting state. 

The spectrum of this kernel is known, see m for example. 

/ 

The eigenvalues are (3k = 2y^cos( for 1 < fc < iV and 14 is proportional to 

V 

where Xk,i = - ^/qsm{^). 

Some calculus prove that is satisfied if < —(1 — 2q) 

If xn is the smallest integer such that xn > N -\-1 — cot“^((l — 2q) cot{jf(p^)), 
xn ~ fV + 1 — which is close to N for N large enough. 

It can be proved, using methods of martingales that for all 0 < x < N, 


Qx = q, 

so N is 

Xkfi \ 
Xk,N / 

we have 


E[r,] = ^ .M - i^r) a^{T^) =Ax + B + + Exi^f 

p-q [p-qY p p p p 

with A = o,nd B,C,D,E some constants depending of p. 

So proposition ^4 Al l implies that there is a cutoff at time t^ with a window a^, where tff = 
and (7at = VYl. 

To prove this, one can take y = N — \/N. 

If P = 9 = 5 ) we have E[Ta;] = x(x + 1) and a'^{Tgf) = +2x-i) ^ 

It is well known that in this case, there is no cutoff. 

We can prove that in this case + 1 is the smallest halting state for total variation 

for the chain started at 0 , but the condition a(Tx*^) = o(E[Ta;j^]) is not satisfied. 

Remark 4.5. We can suppose that p depends of N. 

As we have seen above, the smallest halting state for total variation for the chain started at 
0 is bigger than iV + 1 — cot“^((l — 2q) 

Let (5 > 0 and pn = 5 + ;^ and qN = ^ 

So the smallest halting state for total variation for the chain started at 0 belongs to {[y]) . ■. ,N}. 
Denote it by x^. As in the case above, the condition <j{Tx‘^) = o(E[T 2 ;j^]) is not satisfied. 

Remark 4.6. If we suppose that pm = \-\- cn and qN = \ — with cn > 0, cat = o(l) and 
Ncn —t +00. 

lEe have cr'^iTN) ^ and E[TAr] - So o-(TAr) = o(E[rAr]). 
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Furthermore it is easy to see that y = N — y satisfies the hypothesis of proposition ^. 1\ ) , 
so we have a cut-off. 

4.3 The case of symmetric birth-and-death chain 

Definition 4.7. A Markov chain on \ = {0, ..., N} with transitions given by K = {k{x, y))x,ye^ 
is symmetric if for all x € y € then k{N — x,N — y) = k{x, y). 

Two different cases can occur, N odd or N even. The spectral analysis differs in these two 
cases. 


4.3.1 The case of symmetric birth-and-death chain on {0, ..., 2N + 1} 

Denote by P the transition matrix of the birth-and-death chain, by Q the restriction of P to 
{0,...,iV}x{0,...,7V}. 

Let L be the {N -I- 1) x {N 1) matrix define by Lij = 1 ii i j = N and Lij = 0 if not. 


So P is the tridiagonal matrix given by P = 


Denote by Qi the matrix equal to Q unless the entry Qi{N, N) that is Qi{N, N) = r^ -\-pN- 
So Qi is a stochastic matrix. 


/ 

Q 

\ 

PN 

Qn+1 



LQL 

V 

/ 


Proposition 4.8. If P is monotone, then iV -|- 1 is a halting state for total variation distance 
for the chain started at 0. 

Proof. If {Xn)n>o is a birth-and-death symmetric chain on {0,..., 2iV-|-l} with transition matrix 
given by P then the process (Zn)n>o given hy Zn = N -h ^ — \Xn — (iV-|- i)| is a birth-and-death 
chain on {0,..., N} with transition matrix given by Qi. Its stationary probability is given by tt 
with Tr{x) = 27r(a;). 

And if P is monotone then Qi is monotone too. So for all n > 0, < 1- 

Also for all n>0,x i—>■ is decreasing. So 

r,P"(0,Af-l-l) ^ P"(0,Af) I P"(0,Af-l-l) 

^ 7i-(Ar-|-l) — Tr(N) 7r(Af+l) 

< ^ (P(A„ = N)+ P(X„ = N+1)) 

<^P(Z„=iV) = 2«M<2 


□ 


Proposition 4.9. If for all x G {0, ..., 2N 1}, = 0 then for all n> 0, for all k > 0, 


P"(0,jV + l) 
2tt{N + 1) - 


and 


P^{0,N + 2) 
2Tr{N-G2) 


< I 


Proof. Denote by Q 2 the matrix equal to Q unless the entry Q 2 {N,N) that is Q 2 {N,N) = 


TN -PN- 

We have the following easy lemma : 
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Lemma 4.2. If X is an eigenvalue ofQi associated with the eigenvector v, then X is an eigenvalue 
of P associated with the eigenvector 

If X is an eigenvalue of Q 2 associated with the eigenvector v, then X is an eigenvalue of P 
associated with the eigenvector 

Denote by 1 = /3o > /3i > • • • > /Sat the eigenvalues of Qx. 

Suppose N = 2a, the proof is similar if N is odd. 

The formula (1551) of the appendix tells us : 




N 

p2"+i(o, N + l) = 2n{N + 1)(1 + ^ /3fwith i/fc > 0 

k^l 

If we denote for 0 < A: < a, X2k = Pk and for 0 < A: < a — 1, A2fc+i = \PN-k\, the sequence 
{Xk)o<k<N by property ([5^ of the appendix, is decreasing and 

N 

p2n+i(o^^_^l) ^ where B 2 k = {-^)^X 2 ki'k and B 2 k-\-i = i-l)^~^^X 2 k-\-iJ^N-k 

k^l 

So B 2 kB 2 k+i < 0 and B 2 k+iB 2 k +2 > 0. The following lemma gives the result. 

2d 

Lemma 4.3. Let ct^Bk where 

k=0 


Bo > 0, Vi B2iB2i+i < 0, B2i+iB2i+2 >0, oo — 

We suppose that Tq = Ti = • • • = T^-i = 0. Then n i—>■ T^ 
The proof is in part (5.3) of the appendix 


1 > oi > 02 > • ■ • > a2d > 0 (23) 

is stricly increasing on {d, d+ 1, • • • }. 

□ 


4.3.2 The case of symmetric birth-and-death chain on {0, ..., 2N} 

Denote by Q+ the stochastic matrix on {0, ..., N} x {0, ..., N} defined by : 

f Q+ix, y) = P{x, y) if 0 < x,y < N - 1 
\ Q+{N,N-l) = 2pM, Q+{N,N) = rN 

Proposition 4.10. If P is monotone andpN-i < then N is a halting state for total variation 
distance for the chain started at 0. 

Proof. If {Xn)n>o is a birth-and-death symmetric chain on {0,..., 2iV} with transition matrix 
given by P then the process {Zn)n>o given hy Zn = N — \Xn — A^| is a birth-and-death chain 
on {0,..., N} with transition matrix given by Q+. Its stationary probability is given by tt with 
7:{x) = 2'k{x) if X < — 1 and Tt{N) = 7r(iV). 

And if P is monotone and pn-i < xm then (5+ is monotone. So for all n > 0, ^ < 1- 

But Q’l(0, N) = F{Zn = N) = F{Xn = N) which gives the result. 

□ 
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Lemma 4.4. It exists 71 ,... , 7 Ar such that for all n > 0, ^ = 1 + Pk"/k where 1 

Pi > ■ ■ ■ > Pn are the eigenvalues of Q+. 


> 


k=l 


Proof. If A is an eigenvalue of Q associated with the eigenvector v, then A is an eigenvalue of P 

( ^ 

associated with the eigenvector 0 

\-Lv^ 

If A is an eigenvalue of Q+ associated with the eigenvector ( , then A is an eigenvalue of 


P associated with the eigenvector ( a; | . 

^Lvi 

Spectral properties give the result. 


□ 


Proposition 4.11. If all the eigenvalues of P are positive then N is a halting state for total 
variation distance for the chain started at 0 . 


N 

Proof. By using the lemma above, for all n > 0, = 1 + ^ Pklk- 

k=l 

So the result comes from the following lemma (I4.5|) proved in the appendix with r„ = ■ 


d 

Lemma 4.5. Let 1 = Aq > Ai > ... > A^ > 0 and r„ = ^ AiXf > 0 


i=0 

If for 0 < i < d — 1, ri = 0, then n —> r„ is increasing and so for all n > 0, r„ < Aq. 


□ 


Proposition 4.12. If for all x € {0,..., 2N}, = 0 then for all n > 0, 


If 2k > N, 


P""(0,2fc) ^ ^ 

2TT{2k) - 


If2k+1>N, 


p2"+i(o,2A: + 1) 

27r(2fc + 1) ~ 


Proof. We define two stochastic matrices Qe and Qo respectively on {0,, N} x {0,..., N} and 
on {0,..., - 1} X {0,..., iV - 1} by 

Qe{x,y) = P'^{2x,2y) and Qo{x,y) = P'^{2x + l,2y + 1) 


These stochastic matrices have positive eigenvalues, so they are monotone. Their stationary 
measures are given respectively by Tre{x) = 2t:{2x) and tTo{.x) = 27r(2x+ 1). They are symmetric 
so we can apply proposition (14.81) and proDOsition (l4.11l) . 

The relations P^”(0, 2x) = <5"(0, x) and P^”+^(0, 2a: + 1) = Q"(0, x) give the results. □ 


Proposition 4.13. Let P be a transition matrix of a birth-and-deadh Markov chain on "K = 
{0,...,2iV} be given by P = rl -\- (1 — r)P where for all x € X, P(x,x) = 0. We suppose 
0 < r < i and P monotone. Then for all n > 0, P”(0, N -\- 1) < 7r(7V + 1). 
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Proof. It is an easy consequence of the above proposition (14.121) . 

For all n > 0, P” = ^ {1 - r)^P'^. Let 2a; + 1 e X, 2a; > TV. 

Then p2"(o^2a; +1) = V (" - rf’^+^P^’^+\0,2x + 1). 

fe =0 

So 2a; + 1) < 7r(2a; + 1)(1 - (2r - < 7r(2a; + 1) 

The same method gives that for 2x € X,2a; > TV, P^”+^(0,2a;) < 7r(2a;). 

Using monotonocity, we have the result. □ 


4.3.3 Continuous time symmetric birth-and-death chain on X = {0,..., TV} 

Proposition 4.14. is a halting state for total variation for the continuous time Markov 

chain started at 0. 


Proof. Let Q be the generator of the markov chain. Q is supposed symmetric, Q{N — x,N — y) = 
Q{x,y). 

Let q>2 max{q(a;), a; G X} and P = I + ^Q. 

P is the transition of a symmetric birth-and-death chain on X which satisfies Tx = P(a;, x) = 
1 — > ^. So this chain is lazy and we can apply Dror)osition (l4.8|) and nroDOsition dd.llIl which 

say that for all n > 0, P"(0, 


But for all t > 0, 7rt([^]) = e‘'3(0, [^]) = e"?* 


+ 00 


(tqy 


n—0 


P"(0, 


rTV-t 1 


]) < ^([ 


N + 1 


])■ 

□ 


4.3.4 Applications to cutoffs for symmetric birth-and-death chain 


Proposition 4.15. Under the hypothesis that the birth-and-death is symmetric on {0,...,A} 

^ 1 

such that x’y = -|- 1 is a halting state and (1 — Ai) --- > -too, then the chain has a 

, ,1 Afc 

E[Px* ] cutoff for total variation. 

^ 1 

Furthermore E[T 2 ;j;^] ~ ^ - 

fc=i 

^ > 0, then the chain exhibits a cutoff at time E[Ta,* ] with a 


1 — Afc 


N 


//liminf(l - Ai)^^ 
window a{Tx* ). 




( 1 - A ,)2 


N 


Furthermore in this case, the chain exhibits a cutoff at time ^ - — with a window 




N 

E' 


Afe 


Here 1 = Aq > Ai > 


> Xn o,re the eigenvalues of the chain. 


Proof. One has to prove that the hypothesis of this proposition imply these of the proposition 

m- 
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By m and using the notation of the appendix, 


^ 1 
k=0 ^ 

But as 7r({A^ + 1,..., 2iV + 1}) = i, by (HI]) < 1 - 7 o < 1 - Ai. 

By lemma (14.21) and (|5T|) . for all 0 < fc < iV, A 2 fc = Pk and A 2 fc+i = Ofe. 

So by (HTT]) . (1 — Ai)E[T/v_|_i] tends to +oo. 

The proof is almost the same in the case of a symmetric birth-and-death chain on {0,..., 2A^}. 

□ 


4.3.5 Two examples 


The Ehrenfest process The Markov kernel of the Ehrenfest chain on {0,..., N} is defined 
by 

N — X 1 X 


Px = 


Qx — 


A^ + 1’ + iV + 1 

The stationary distribution is given by 7r(x) = and the eigenvalues are A^ = 1 — 


0 < k < N with L^(7r)-normalized eigenvectors 


Vkix) = 


-i fc 




i=0 


N -X 
k — i 


In this case |AAr| = Ai. It was proved in lemma (14.11) that for a monotone chain, |AAr| < Ai. 
We can search candidates to be halting states by da for the chain started at 0 : 


Ei(0)Ti(x)±yiv(0)IV(x) <0 


2a; > iV + 1 


So the smallest candidate is + 1- 


If N is odd, we can apply Droposition (l4.8l) ans so is a halting state for total variation 
for the chain started at 0. 

If TV = 2N' is even, we haven’t qn'+i < 'Cat' so we can’t apply proposition dd.lOp . 

But P = r/ + (1 — r)P with r = and P{x,x) = 0 for all x. So we can apply proposi- 
tion (l4.13|) and so TV' + 1 is a halting state for total variation for the chain started at 0. 

So a;AT = [y] + 1 is the smallest halting state for total variation for the chain started at 0. 


N 


N 


N 


As 


I] ttit: = X! fc ^ I] 


Afc 


fc=i 






(1-Afe)= 


^ 1 

■Er 


iV+ 1 ^ fc 

fe=l 


^ 1 ^2 N ^ 

> Tw — I proposition (14.151) gives the existence of a cutoff at time A > - 


fVlniV 


fc2 ' 6 

fc=i 

with a window N. 


fc=i 


Aa: 


Bernoulli-Laplace models Consider two urns, the left containing r red balls, the right N — r 
black balls with 0 < 2r < N. At each step, a ball is picked uniformly at random in each urn and 
the two balls are switched. The process is completely determined by the number of red balls in 
the right urn and this is a birth-and-death chain on X = {0,..., rj.The stationary distribution 
is 
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and for x G X, the rates are given by 


(r — x)(N — r — x) x^ 

Px = - 7T7 -^-, Qx = 


r{N — r) ’ r{N — r) 

The eigenvalues of this chain are well known (see m or i) and are given by 


f3N,r,i = 1 - 0 < I < r 

r[N — r) 


( 24 ) 


The first eigenvectors are 


Nx 


Vq = 1, Vi(x) = (1 - where Ci is defined by the condition ||foi||L 2 (^^^^) = 1. 


The sign of foi(0)Vi(x) is that of 1 — so the halting states belong to {xjx > 

In the symmetric case where r = a. halting state must be > ^. 

Proposition 4.16. is the smallest halting state for total variation. 

Proof. Unfortunatly, this chain is not monotone {po + qi > 1) so the proposition dd.lOl) cannot 
be applied. 

Consider the case of N even, and change N to 2N. 

We use the notation of part (3.3.2) and of proposition (14.1011 . The result will be true if is 
monotone and x !->■ is decreasing. 

Indeed, for all n > 0, then x is decreasing and so P^”+^(0, N) = 

Q'^ff+^{0,N) <n{N) = Tr{N). 

In the same way, using decreasing of x i—>■ ^{x), for all n > 0, P^"(0,iV) < tt(N). 

So first, we have to prove that for all x, for all ?/, 

Qlix, {0,..., y}) > Q^(x + 1, {0,..., y}) (25) 


We have that for all — 2 > x > 1, Px + qx+i < 1 and pn-i + Sgvr < 1, so if x > 1, 
Q+(x, {0,..., y}) > Q+(x + 1, {0,..., y}). This inequality is also true if x = 0 and y > 1. 

So if X > 2, 

Qlix, {0,..., 2 /}) = qx{Q+{x - 1, {0,..., y}) - Q+(x, {0,..., y})) 

+ (1 - Px){Q+{x, {0,..., ?/}) - Q+{x + 1, {0,..., y})) 

+Q+{x + 1, {0,..., y}) - (5+(x + 2, {0,..., 2 /}) 

-\-Q+{x + 2, {0,..., 2/}) 

> qx+i{Q+{x,{0,...,y}) - Q+{x + 1,{0,.. .,2/})) 

+ (1 - Px+i)Q+{x + 1, {0,..., y}) - (3+(x + 2, {0,..., 2/}) 

+Q+{x + 2, {0,..., 2/}) = Q+(x + 1, {0,..., 2/}) 


It remains to prove (|25ll for x = 0 and x = l,y = 0. 

Q+(0, {0}) = poqi + rg = > riqi = Q+(l, {0}) 

Q+(0,{0,1}) = qi + ri 

<3^(1, {0,1}) = qi + ri(l -pi) +piq 2 = qi + ri + pi{q 2 - ri) 
Q+(1,{0}) = riqi 
Q+(2,{0}) = qiq2 
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So the condition is satisfied if <72 < fi and this the case for the Bernoulli-Laplace chain. 

q3 ( 0 ,x) 

Now it remains to prove that x 1 —>■ is decreasing. 

Denote 7f„ = Q" (0,.). 

We have by reversibility that ' 


So by some calculus, we find 


-{x) = '^^{y)Q+{x,y). 

TT 

y 


- gl +^1 +Plg2 _ riq2 + ^2^2 _ 92^3 

n ^ ^ n ^ ’ ?i(l) n ^ ^ ni{l) n ^ ’ ni{l) 

So we have to verify if ri> qi+ rf +piq 2 > riq 2 + g 2?'2 > 9293 - It is true if N is big enough. 

Consider now the case where N is odd and replace N by 2N + 1. 

By the same ideas, it suffices to show that Qi and are monotone where Qi is given in 
proposition (14.811 . In fact Px + qx+i < 1 if 1 < cc < 2N — 1, but using symetry, P‘^{2N +1 — 
x, {0,..., y}) = 1 — P^(x, {0,..., 2N — y}), so there is no additional verifications to do. We 
conclude as in proposition (14.81) . 

□ 

By the same method as used for the Ehrenfest chain, using (14.161) and the knowledge of 
eigenvalues given by (El, the symmetric Bernoulli-Laplace birth-and-death chain on {0,..., N} 
exhibits a cutoff at time with a window N. 


4.3.6 Asymmetric continuous-time Erhenfest process 

Let Zt = ..., be a continuous-time Markov chain on (Z/2Z)^ where 

are iid continuous-time Markov chain on Z/2Z with infinitesimal generator given by 
We have 

Po(^i = 0) =ft(0,0) = T^ + 


l<i<N 

-A A 


A+/J. A+/4 

= 0) =P*(1,0) = ^ - 

= 1 ) = p,(l, 1 ) = 


(26) 


N 


If Xt = \Zt\ then (Xt) t>o is the asymmetric continuous-time Ehrenfest Markov 


i—1 

chain on {0,..., N}. 

Furthermore if 0 < i, j < X, 

p,(Xt=j)= 

k+t=j 


N-i 

I 


Pt(l,l)'=Pt(l,0)*-Vt(0,l)'pt(0,0) 


N-i-l 


(27) 


The stationnary distribution is the binomial distribution B{N, j^)- 

So = (1 -I- ^ 

So it is easy to prove that the minimal halting state is ^ = |" . 

Proposition 4.17. The asymmetric continuous-time Erhenfest process has a cutoff at time 

logjN) 
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Proof. The smallest halting state is given by ^ = |" . Let y* = N — x*. 

We have to prove that the hypothesis of are satisfied. 

Denote by Qa./^ the generator of our chain and by A the restriction of Qa./x to {0,..., xJ ^ — 1} 
and by B the restriction of Qfj.,\ to {0,..., y* — 1}. 

For all 0 < * < iV, * + 1) = \{N — i) and qx,fj.{i, i — 1) = yi. 

Let (Pq'^, ,..., the sequence of orthogonal polynomials associated to Qa./x- 

The eigenvalues of —Qx.fj, are tk = k{X + p), 0 < k < N. 

If L is the {N + 1) x {N + 1) matrix defined by Lij = liii + j = N and Lij = 0 if not, then 
k'Q x.^ki — Qfi.x- 

So by an argument of symmetry, for all 0 < < A^, for all 0 < x < iV, = 

p^^{tk)p^’>^(tk). Furthermore, by ([301), {-l)'^p^'^(tk) > 0. 

Let Va = 0 < ao < «! < • ■ • < ctx*-i be the eigenvalues of —A. 

Let Vb = 0 < /3o < /?! < ■ • ■ < be the eigenvalues of —B. 

Lemma 4.6. For all 0 < i < N — 1, the interval [ti,ti+i\ contains a single element o/VaXI Vb- 


Proof. It is a consequence of Sturm’s theorem : The number of sign changes in the sequence 
PQ’^(t),Pi’^{t),... ,p^^(t) gives the number of eigenvalues > t. The convention is that if 
Pk+i{t) = 0 and pkit) 7^ 0, the sign of pk+i{t) is equal to the sign of pk(t)- 

Apply this to t = —ti : 

The single change of sign can occur in the sequence ti), ti),..., p^f'{—ti), and 

in this case , —ti < —oq < 0 

or can occur in the sequence (p^f(-ti),...,p^^(-ti)) = -(p'^'^{-ti),..., p'^ty^(-ti)), 
and in this case —ti < —(3 q < 0. 

This method is reiterated. □ 


y*-i 


But Eo[T^:^] = XI ^ Eo[rp^] = E —. Denote the first mean by Un et the second 
by Vn. 


i=0 


i=0 


^1 11^1 

The lemma implies that > — < Un + Vat <-f + / ~- 

^ ao Po U 


i=l 


As in the discrete time, we have by (HSl), ao > Trx,^i({x* + 1, • • • , N})ti. 


N 


But 7rA,;,({x* + I, • • • , N}) ~ i, then t/jv + I4r ~ E 

i=l 

Lemma 4.7. (Un — Vn) is bounded. 


1 log(N) 
ti A + /r 


Proof. By using the equality proved in [3], chapter 2, lemma 12, 




with Zij = 


I'+oo 

L ' 


\(Xt = j) -TT(j))dt, 


we have 


1 /*+oo 

Un-Vn = -^ / (P^(E’^ = X*) - Po(E’'' = 

T^xA^ ) Jo 

By using formula (1371) . 

/*+oo 

Un-Vn= (1 + (1 - - (1 - 

Jo ^ 


x*))dt 

(l + -e-*V+u))y‘ dt 
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After the change of variables u = e ^ we obtain 


Un — Vn 


1 {I +- u)y'- {I 
\ + fl Jq U 


ur\i + ^u)y* 

- - - du 


Some calculus gives the result. □ 

Then as Un + Vn ^ Un — Vn is bounded, Un ^ 2 (a^) result comes from 

proposition a. 

□ 


5 Appendix 

5.1 Birth-and-death chain and orthogonal polynomials 

A reference for orthogonal polynomial and birth and death chain can be found in Karlin and 
McGregor ( [16], [15]). 

Denote by P the transition matrix on {0,..., N} and by Pk the restriction of P to {0,..., k} 
for 0 < k < N. 

For 1 < fc < A^, denote by (pk the polynomial given by (pk{t) = ^ det(t/fc — Pk-i)- Let 

= 1- 

So {ifQ ,..., pn) is a family of polynomials which satisfy the following recurrence equation : 

^k{t) = - -(28) 

Pk-i Pk-i 

Let 1 = Ao > Ai > • • • > Atv be the eigenvalues of P. 

It exists (/tq, ..., pn) & such that {po ,..., pn) are the orthogonal polynomials in 

N 

l],/.i) where p = PkS\^ ■ 

Furthermore pk{tfp{dt) = ^ = Po = 7r(0). 

We have for all n > 0, for all x, ?/ € X, 

y) = ^J2 K^^{>^k)Py{\k)pk (29) 

V ) r._n 


So the normalized in (tt) eigenvector 14 associated to the eigenvalue \k is given by 14 (x) = 



The zeros of the orthogonal polynomials have the following interlacing property : if m > n, 
there is a zero of pm between any two zeros of pn- 

So there is a zero of pN in each interval ]Afe+i,Afc[ and as pn(X) = have : 


VO < fc < At, 


(-l)Vw(Afc) > 0 (or (-l)'=I4(iV) > 0)) 


(30) 
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The case of a symmetric birth-and-death chain on {0,..., 2N + 1} Denote by Qi the 
matrix equal to Pm unless the entry Qi{N,N) that is rjv + Pn and by Ci(t) the characteristic 
polynomial of Qi. 

Denote by Q 2 the matrix equal to Pm unless the entry Q 2 {N,N) that is tm — Pm and by 
(72(t) the characteristic polynomial of Q 2 - 

Let 1 = /3o > /3i > • • ■ > /Sat be the eigenvalues of Qi. 

Let oo > cti > • • • > aM be the eigenvalues of Q 2 ■ 

Let rjo > r]i > ■ ■ ■ > r/M-i be the eigenvalues of Pm-i- 
Let 7 o > 7 i > • • • > 7 Ar be the eigenvalues of Pm ■ 

So by the interlacing property, for all 0 < fc < — 1, Pk+i < Vk < (3k, ol^+i < Vk < ctk and 

7fc+i <r]k < Ik- 

We have (po ■ ■ ■ PM-i)~^Ci(t) = (t - tm - PM)pN{t) - qMV’N-iit), (po ■ ■ ■ PM-i)~^C2{t) = 
(f - rM +PAr)pAr(t) - qM^N-iit) and PAr+l(t) = [t - rAr)(pAr(t) - qM(pM-l{t). 

So (po • • •pAr_i)“^(7i(afc) = -2pMPM{ak)- But as rjk < cck < Pk-i, {-l)^PM{ak) > 0. 

But Uk G]?7fc,/3fc[ or Uk G]/3fc,pfe_i[ and (—l)^(7i < 0 on the hrst interval and (—l)^(7i > 0 on 
the second interval. So we have for all 0 < fc < iV, Ofc < f3k- 

As (po • ■ ■pM-i)~^{Ci{t) + C 2 {t)) = 2ipM+i{t), Ci{'^k) = -C 2 {ik) so for all 0 < fc < TV - 1, 

rik < ak < Jk < Pk < Vk-i (31) 

with ? 7 _i = +00 et riM = — 00 . 

Suppose now that for all a; G X, Tj, = 0. 

In this case if k is odd then ipk is odd and if k is even, ipk is even. 

Furthermore C 2 {—t) = (—l)'^“^(7i(t) and so for all 0 < /c < TV, = —^M-k- So we have : 

( If TV = 2a + 1, 0 < |/3a+l| < Pa < |/3a+2| < • • • < < |/32a+l| < ,do = 1 ('on', 

[ If TV = 2a, 0 < Pa < |/3a+l| < Pa-1 < |/3a+2| <■■■< Pi < |,02a| < Po = ^ 

So by (1^ . we have in the case where TV is even : 

p2-+i(o,TV + l) = (33) 

' k=0 

and (pM+i(Pk) has the sign of (—1)^'. 


5.2 Proof of results on continuous time 

For each 0 < h, consider the Ti-skeleton discrete time Markov chain = X{nh), n G N. 

X(h) 

is a Markov chain with transition matrix initial distribution ttq and stationary 
distribution tt. 

n 

We denote the corresponding quantities ([7]) and dH) by '01^^ and ~ 'P’k^\^k^'^)) 

k=0 

Proposition 5.1. For all t ^ A, there exists (h„) —> 0 such that a.e. on D 



Jt 


Proof. For all x G X, 1 1 — > Tit(x) is analytic with derivative TitQix). 

By consequence Ax = {T > 0/7rt(x) = 7r(x)} is either equal to IR+ or contains only isolated 
points. 
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Let T = {x £ X/ Vt > 0, TTt[x) = 7r(a;)} and A = VJ^^'j-Ax- 
For alH > 0, ^ n [0, t] is a finite set. 

Let t ^ A. We take ^ 0 such that there exists e > 0 such that for all n > 0, for all 

a £ ^ n [0, t], d{a, hnN) < h]^^. 

We choose uj such that for all r > 1, Tr{ijj) ^ AVJ {t} where {Tr)r>o are the jump times of 
the continuous-time Markov chain {Xt)t>o- 

First case : Vs £ [0,f],7rs(Ws(w)) > tt{Xs{u}) 

So for all s £ [0,t], Xs{io) ^ T and as Tr{uj) ^ Vl U {t}, by continuity there exists a > 0 
such that for all s £ 7rs(Xs(a;)) — 7r(Xs(a;) > a. 


In4'‘")(w)=ln(l- 


tt{Xo{uj)) 

7ro{Xo{uj)) 


^ fiji ^ 

+ ^ In (l - ^[^"\Xkh,,{u}))j 


k^l 


So, we have, using continuity and derivability of i i—>■ 7rt{x) : 


lnjf")(cc)=ln 1- 


7r(Xo(a;)) 

7ro(Xo(a;)) 


hn 


(7r/t„fc A TT)Q{Xkh^{uj)) 

^ TTh„kiXkh„{uj)) - 7T{Xkh„{to)) 


0{hr, 


That gives the result in this case 


Second case : 3s £ [0,t],7rs(Xs(a;)) < 7r(Xs(u;) 

If there exists s £ [0,t] such that 7rs(Xs(u;)) < 7r(3fs(w)) then by continuity, = 0 

for n big enough. 

So we suppose that there exists sq £]0,t[ which satisfies TTsp(Xgg(w)) = 7r(Xso(a;)) and for 
all u £ [0,t], TTuiXuiuj)) > 7r(X„(w)). 

Let X = Xsaiuj) and r such that Triio) < sq < Tr+i{uj). 

We want to prove that 0. 

Let s„ £ /inN which satisfies |s„ — so| < . For n big enough, Triuj) < Sn — hn < Sn < 

Tr+i(w). 


We have < 

Define for n > 1, 




Rn = {x £ X/nsg{x) = T:{x),TTsgQ'^{x) = 0 for 1 < fc < n—1, (tTs^A 7r)(5^(a;) = 0 for 1 < /c < n} 


Let n > 2. We prove by iteration that 

X £ Rn=^ yy/q{y,x) ^0, y £ (tTso A n)Q^{y) < 0 and TTsoQ"^~^{y) > 0 (34) 

Let n> 2. We prove by iteration that 

X £ Rn and tTsoQ’^{x) = 0 =^\/y/q{y^x) ^Q,y £ Rn-i and 'rrsaQ'^~^{y) = 0 (35) 

For n big enough, 7rs„ (x) - (7rs„_/i„ A7r)e''"‘5(a;) = 7rs„ (x) -7r(x) + (tt^^A 7r)Q^(x). 

k>l 
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Define the following sets : 


V = {y£ X/TTsoiy) > TT{y)}, M = {y e X/nsoiy) < iriy)} 

A+ = {ye X/iTsaiy) = iriy), (7rso-7r)(y-) > 0}, A- = {y £ ^/iTsoiy) = 7r(y), {TTso-n){y-) < 0} 
So, if n is big enough and s = Sn — 

{7rsATT)Q^{x) = ^ Tr{y)qk{y,x)+ ^ ■Ks{y)qk{jj,x) = ^ {■Ks{y)--K{y))qk{jj,x) 

y€T UVUA+ yeMUA- yeMUA- 

So if we denote ai^k = E '^soQ''{y)qi{y,x), we have 

yeMuA^ 

(7rs„-/i„ A 7r)Q‘(x) = (tTsq A 7r)Q‘(x) + ^ and so 

r>l 


Tr(x)-(7rs^-h„A7r)e^^^(x) 


'‘1 (36) 

1>1 1>1 r>l 


We prove by induction that for n > 2, 


r iffc + Z<n—1 then akj = 0 
1 if fc + Z = n then ak,i > 0 


We prove by induction that for n > 2, 


(37) 


X € Rn and tTsoQ^{x) = 0 


if Zc + / < n then ak^i = 0 
if Zc + Z = n + 1 then (—< 0 


(38) 


Let 2Zco be the smallest integer Zc which satisfies TrsgQ^{x) ^ 0. We have TrsgQ'^^°{x) > 0. 

Let ki be the smallest integer k which satisfies (tTs,, A tt)Q^{x) ^ 0. 

If Zci < 2Zco then by (1551) 

7''(a()-(7rs„-/i„A7r)e^"^(a;) =-(7rsoA7r)Q'=i(a;)E_ ^ Elfll-^^1- E-q,; ^,+o(Zi^3) 


But as a; G Rki-i, using (IMl) . we have (tTs^ A7r)(5^ ^(i) <0 and using (1551) (—!)’'«;,r(x) < 0 
if Z + r = Zci and ai^r{x) = 0 if Z + r < Zci, so there exists C > 0 such that 

T^ix) - (7r«„-/i„ A7r)e'‘"‘5(2.) = Zi^iC + o(Zi^0 

If ki > 2ko, then x G i? 2 fco- By ((36l) . we have 

-xix) - (7rs„-/i„ A 7r)e^"^(a;) = hl^° | ^ ( 

y/,r> l,/+r—2fco 


- 1 ) 


Iw 


+ o(Z*f») 
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But ai^ 2 ko-i[x) = ^ {'KsoQ'^'^°~^){y)q{y,x). By ([Ml) and dSS]), if q{y,x) > 0, then 
y&A- 

y € i? 2 feo -2 and TTsoQ'^'"°~^{y) > 0. 

Furthermore if y G A+, then TTsoQ^'‘°~^{y) = 0- So ai^ 2 fco-i( 2 ;) = 'XsoQ^^°{x) > 0. 

And using again (pTHll . there exists C > 0 such that 

tt{x) - iTTs„-h„ A 7r)e^"'^(a;) = hl^°C + o{hl^°) 

The fact that T^s^ix) - t:{x) = tTso( x) ° + o((s„ - so)^^°) and |s„ - so| < hl+^ 

implies that goes to 0. 

□ 

We give now the proof of proposition (12.61) . 

Proof. We define for t > 0 and x G X, f{t, x) = F{T < t, Xt = x) and l{t, x) = P(T <t,Xt = x). 
This proposition is the consequence of the two following lemma. 

Lemma 5.1. l{t, x) = {nt A 7r)(x) 

Proof. By using the proposition (EH), we have 

l{t,x) = P(t/ > Jt,Xt = x) = lim ¥(U > jl^f],Xt = x) 

But hn[^] < t so, 

m > J[7j ,Xt=x) = nu > T[7}, =y,Xt = x) 

= Ey^xnu > 

-!> (tt* a 7r)(x) 

this by (El) and ([5]). □ 

Lemma 5.2. x) = §(t, x) - ^ l{t, y)q{y, x) 

yex 

It is true for all stopping times. 

□ 


5.3 Proof of lemma (14.311 

2d 

Lemma 5.3. Let r„ = off.Bk where 

k=0 

Bq > 0, \/iB2iB2i+l < 0, B2i+lB2i+2 >0, Oq = 1 > Ol > 02 > ■ • ■ > Q!2d > 0 (39) 

FFe suppose that Tq = Ti = • • • = T^-i = 0. Then n ^-AVn is stricly increasing on {d,d+ 1, • • • }. 
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2d 

Proof. We prove the result inductively by the study of the real function (p{t) = Bo + ^ 

k=l 

where /ife = — In . 

We prove that if (IHH is satisfied then 

• If c? is even, there exists 1 < fc < |, there exists < ... < t 2 k-i such that 
VI < f < fc, (/3 is strictly increasing on [t 2 i-i,t 2 i[ 

V0<i<fc—l,(/3is strictly decreasing on [t 2 i,i 2 i+i[ 
where to = —oo,t 2 fc = +oo. 

• If d is odd, there exists 1 < A: < there exists ti < ... < t 2 k such that 
yO < i < k, ip is strictly increasing on [t 2 i,t 2 i+i[ 

\/l < i < k, if is strictly decreasing on [t 2 i-i,t 2 i[ 
where to = —oo,t 2 fe+i = +oo. 


It is easy to see that if d = 1, is strictly increasing. 

2(d+l) 

If d is odd and ip(t) = Bq + Bke~^'°^^ we write ip'{t) = and ip'it) = 

k=l 

9 satisfy the same hypothesis so there exists 1 < fc < , there exists h < ■ ■ ■ < t 2 k such 

that 

VO < t < fc, d is strictly increasing on [t 2 i,t 2 i+i[ 

VI < t < fc, d is strictly decreasing on [t 2 i-i,t 2 i[ 
where to = —oo,t 2 k+i = +oo. 

Furthermore lim d(t) = —i32d2(d2 ~ di) > 0- 

t—>-+oo 

The number of zeros of d is odd and smaller than 2fc + 1 . So there exists I < fc, there exists 
Si < • • • < S 21+1 such that 

d < 0 on ]s 2 i,S 2 i+i[ for 0 < t < /, sq = —oo. 
d > 0 on ]s 2 i+i, S 2 i+ 2 [ for 0 < t < /, S 2/+2 = + 00 . 

Furthermore lim i/jlt) = Bipi < 0. 

>- + oo 

So the number of zeros of ip is odd and smaller than 2/ + 1. So there exists m < I, there exists 
ri < • • • < r 2 m+i such that 

■0 > 0 on ]r 2 i, r 2 i+i[ for 0 < t < m, tq = — 00 . 

< 0 on ]r 2 i+i,r 2 i+ 2 [ for 0 < t < m, r 2 m +2 = + 00 . 

As m < I < k < we have proved the result for ip. 

The case where d is even is the same. 

2d 

The preceding result prove that (p{t) = Bq + ^ Bke~^’^^ has at most d zeros. So if p{0) = 

(^(1) = ... = (^(d — I) = 0, t ip{t) is strictly increasing on ]d — 1, +oo[ and this proves the 
lemma. 

□ 


5.4 Proof of lemma (14.51) 

d 

Lemma 5.4. Let 1 = Aq > Ai > ... > > 0 and r„ = ^ AiXf, 4o > 0 

i=0 
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If for 0 < i < d — 1 , ri = 0, then n — > r„ is increasing and so for all n > 0, r„ < Aq. 

Proof. We show the result inductively. 

Ifd=l, r„ = Ao(l-A?). 

We suppose the result true for d — 1. We can suppose that for all 0 < i < d, 0 et denote 
by e the sign of Ai. 

Then for all n > 0, r„+i — r„ = eA”d„, where 


r /_1 



Furthermore, Oq = ■ ■ ■ = 6 d -2 = 0, so by hypothesis, n —>■ is increasing, so as = 0, for all 

n > 0, > 0 and n —> er„ is increasing. 

So 0 = eFo < er„ < limji_),+oo eF^ = cAq. 


As Aq > 0, then e = 1. this completes the proof of the lemma. 


□ 
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